Frequent Itemsets Used in Mining of Train Delays
نویسندگان
چکیده
منابع مشابه
Efficiently Mining Maximal Frequent Itemsets
We present GenMax, a backtrack search based algorithm for mining maximal frequent itemsets. GenMax uses a number of optimizations to prune the search space. It uses a novel technique called progressive focusing to perform maximality checking, and diffset propagation to perform fast frequency computation. Systematic experimental comparison with previous work indicates that different methods have...
متن کاملMining Train Delays
The Belgian railway network has a high traffic density with Brussels as its gravity center. The star-shape of the network implies heavily loaded bifurcations in which knock-on delays are likely to occur. Knock-on delays should be minimized to improve the total punctuality in the network. Based on experience, the most critical junctions in the traffic flow are known, but others might be hidden. ...
متن کاملSet Overlap in Mining of Frequent Itemsets
An important module of soft computing methods is the set overlap operation. If a query set is tested with a large pool of source sets, the signature-based or the inverted-file methods are used to reduce the cost of operation. The paper introduces a modified version of the inverted-file approach, which yields in lowest costs for sparse input samples, i.e. where the number of records containing a...
متن کاملOptimizing inductive queries in frequent itemsets mining
Let Q = {Q1, . . . , Qn} be a set of past queries and let R = {R1, . . . , Rn} be their results. Moreover, let Q0 be a query newly submitted to the system and let R0 be its result. The task of optimizing the extraction of R0 using the knowledge provided by Q and R have been faced following two distinct approaches. In the first approach we search for a query Qi ∈ Q, such that R0 ⊆ Ri (in such a ...
متن کاملDistributed Frequent Itemsets Mining in Heterogeneous Platforms
Huge amounts of datasets with different sizes are naturally distributed over the network. In this paper we propose a distributed algorithm for frequent itemsets generation on heterogeneous clusters and grid environments. In addition to the disparity in the performance and the workload capacity in these environments, other constraints are related to the datasets distribution and their nature, an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Science and Informatics
سال: 2013
ISSN: 2231-5292
DOI: 10.47893/ijcsi.2013.1097